Robust Target Speaker Tracking in Broadcast TV Streams
نویسندگان
چکیده
This paper addresses the problem of audio change detection and speaker tracking in broadcast TV streams. A two-pass audio change detection algorithm, which includes detection of the potential change boundaries and refinement, is proposed. Speaker tracking is performed based on the results of speaker change detection. In speaker tracking, Wiener filtering, endpoint detection of pitch, and segmental cepstral feature normalization are applied to obtain a more reliable result. The algorithm has low complexity. Our experiments show that the algorithm achieves very satisfactory results.
منابع مشابه
Development of a Speaker Diarization System for Speaker Tracking in Audio Broadcast News: a Case Study
A system for speaker tracking in broadcast-news audio data is presented and the impacts of the main components of the system to the overall speaker-tracking performance are evaluated. The process of speaker tracking in continuous audio streams involves several processing tasks and is therefore treated as a multistage process. The main building blocks of such system include the components for au...
متن کاملA System for Speaker Detection and Tracking in Audio Broadcast News
A system for speaker-based audio-indexing and an application for speaker-tracking in broadcast news audio are presented. The process of producing an indexing information in continuous audio streams based on detected speakers is composed of several tasks and is therefore treated as a multistage process. The main building blocks of such an indexing system include components for an audio segmentat...
متن کاملCASA based speech separation for robust speech recognition
This paper introduces a speech separation system as a front-end processing step for automatic speech recognition (ASR). It employs computational auditory scene analysis (CASA) to separate the target speech from the interference speech. Specifically, the mixed speech is preprocessed based on auditory peripheral model. Then a pitch tracking is conducted and the dominant pitch is used as a main cu...
متن کاملVarious Methods for Visual Speaker Identification for Automatic Continuous Speech Recognition in TV Broadcast Programs
This paper is about different methods and algorithms that were used for speaker identification from the video recordings of TV broadcast news transcription. The information from visual speaker identification were used in our complex system for automatic continuous speech recognition of TV broadcast programs because it is possible to use speaker adapted (SA) Hidden Markov Models (HMMs) if we hav...
متن کاملReliability based budgeting with the case study of TV broadcast
Planning budget will help to identify wasteful expenditures, adapt financial situation changes quickly, and achieve financial goals. The reliability based budgeting has a great importance for broadcasting industry. In this study, several kinds of failure modes in TV broadcasting system have been det...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IJCLCLP
دوره 11 شماره
صفحات -
تاریخ انتشار 2006